Using SVM to pre-classify government purchases
نویسنده
چکیده
The Brazilian government often misclassifies the goods it buys. That makes it hard to audit government expenditures. We cannot know whether the price paid for a ballpoint pen (code #7510) was reasonable if the pen was misclassified as a technical drawing pen (code # 6675) or as any other good. This paper shows how we can use machine learning to reduce misclassification. I trained a support vector machine (SVM) classifier that takes a product description as input and returns the most likely category codes as output. I trained the classifier using 20 million goods purchased by the Brazilian government between 1999-04-01 and 2015-04-02. In 83.3% of the cases the correct category code was one of the three most likely category codes identified by the classifier. I used the trained classifier to develop a web app that might help the government reduce misclassification. I open sourced the code on GitHub; anyone can use and modify it.
منابع مشابه
The Macroeconomic Effects of Government Asset Purchases: Evidence from Postwar US Housing Credit Policy∗
We document the portfolio activity of federal housing agencies and provide evidence on its impact on mortgage markets and the economy. Through a narrative analysis, we identify historical policy changes leading to expansions or contractions in agency mortgage holdings. Based on those regulatory events that we classify as unrelated to short-run cyclical or credit market shocks, we find that an i...
متن کاملThe Optimal Use of Government Purchases for Macroeconomic Stabilization
This paper extends Samuelson’s theory of optimal government purchases by accounting for the contribution of government purchases to macroeconomic stabilization. Using a matching model of the macroeconomy, we derive a sufficient-statistics formula for optimal government purchases. The formula implies that the deviation of optimal government purchases from the Samuelson level is proportional to t...
متن کاملDetection and Classification of Emotions Using Physiological Signals and Pattern Recognition Methods
Introduction: Emotions play an important role in health, communication, and interaction between humans. The ability to recognize the emotional status of people is an important indicator of health and natural relationships. In DEAP database, electroencephalogram (EEG) signals as well as environmental physiological signals related to 32 volunteers are registered. The participants in each video we...
متن کاملProductive Government Purchases and the Real Exchange Rate*
Empirical research documents that an exogenous rise in government purchases in a given country triggers a depreciation of its real exchange rate. This raises an important puzzle, as standard macro-theories predict an appreciation of the real exchange rate. We argue that this prediction might reflect the conventional assumption that government purchases are unproductive. Using a simple frictionl...
متن کاملDetection and Classification of Emotions Using Physiological Signals and Pattern Recognition Methods
Introduction: Emotions play an important role in health, communication, and interaction between humans. The ability to recognize the emotional status of people is an important indicator of health and natural relationships. In DEAP database, electroencephalogram (EEG) signals as well as environmental physiological signals related to 32 volunteers are registered. The participants in each video we...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1601.02680 شماره
صفحات -
تاریخ انتشار 2015